Skip to content

Conversation

wuyujiji
Copy link
Contributor

在天数硬件上适配VL模型

Copy link

paddle-bot bot commented Sep 29, 2025

Thanks for your contribution!

@paddle-bot paddle-bot bot added the contributor External developers label Sep 29, 2025
@wuyujiji wuyujiji force-pushed the adapt_vl branch 6 times, most recently from b5fe3b3 to d0a687a Compare October 14, 2025 01:50
@YuanRisheng YuanRisheng requested a review from Copilot October 15, 2025 08:40
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adapts VL (Vision-Language) models for Iluvatar GPU hardware, implementing platform-specific optimizations and configurations to support multimodal inference on Iluvatar devices.

  • Pin paddleformers version to 0.3.0 for compatibility
  • Implement Iluvatar-specific attention backend optimizations for VL models
  • Add support for text-image processing operations and memory management
  • Provide comprehensive documentation with installation and usage examples

Reviewed Changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 3 comments.

Show a summary per file
File Description
requirements_iluvatar.txt Pin paddleformers to version 0.3.0 for stability
fastdeploy/worker/iluvatar_worker.py Configure Paddle flags for multimodal support
fastdeploy/worker/iluvatar_model_runner.py Add VL model initialization and rope embedding handling
fastdeploy/worker/gpu_model_runner.py Import additional Iluvatar-specific operations
fastdeploy/model_executor/models/ernie4_5_vl/modeling_resampler.py Disable fused matmul bias for Iluvatar platform
fastdeploy/model_executor/models/ernie4_5_vl/image_op.py Add Iluvatar platform support for image operations
fastdeploy/model_executor/models/ernie4_5_vl/dfnrope/modeling.py Disable fused matmul bias for Iluvatar platform
fastdeploy/model_executor/layers/rotary_embedding.py Add Iluvatar-specific rotary embedding handling
fastdeploy/model_executor/layers/attention/iluvatar_attn_backend.py Implement VL-specific attention metadata and tensor handling
docs/zh/get_started/installation/iluvatar_gpu.md Add Chinese documentation for VL model usage
docs/get_started/installation/iluvatar_gpu.md Add English documentation for VL model usage
custom_ops/setup_ops.py Register additional CUDA operations for Iluvatar support

YuanRisheng
YuanRisheng previously approved these changes Oct 15, 2025
YuanRisheng
YuanRisheng previously approved these changes Oct 16, 2025
gongshaotian
gongshaotian previously approved these changes Oct 16, 2025
Copy link
Collaborator

@gongshaotian gongshaotian left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

"gpu_ops/sample_kernels/top_k_renorm_probs.cu",
"gpu_ops/text_image_index_out.cu",
"gpu_ops/text_image_gather_scatter.cu",
"gpu_ops/extract_text_token_output.cu",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

extract_text_token_output已经被废弃,麻烦一并给删掉吧,包括算子实现、单测等等

Copy link
Contributor Author

@wuyujiji wuyujiji Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@yuanlehome 这里指的是删掉custom_ops/gpu_ops/extract_text_token_output.cu,test/operators/test_extract_text_token_output.py和cpp_extentions.cc里关于extract_text_token_output的注册?除了这三处之外,setup_ops.py里"gpu_ops/extract_text_token_output.cu"在metax_gpu里也用到了,我如果删了cu文件的实现会对这里造成影响吧

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@yuanlehome 已经删了,麻烦看下删的对不对

@@ -1,101 +0,0 @@
// Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个custom op确认没有地方使用了么

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

是的,废弃了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

contributor External developers

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants